Optimal weighting of bimodal biometric information with specific application to audio-visual person identification

نویسندگان

  • Roland Hu
  • Robert I. Damper
چکیده

A new method is proposed to estimate the optimal weighting parameter for combining audio (speech) and visual (face) information in person identification, based on estimating probability density functions (pdf’s) for classifier scores under Gaussian assumptions. Performance comparisons with real and simulated data indicate that this method has advantages in reducing bias and variance of the estimation relative to other methods tried, so achieving a robust estimator of the optimal weighting parameter. Another contribution is that we propose the bootstrap method to compare performances of different algorithms for estimating the optimal weighting parameter, so providing a strict criterion in comparing algorithms of this kind. Using simulated data, for which the pdf is controlled and known, we show that the advantages of the method hold up when the underlying Gaussian assumption is violated. The main drawback is that we have to choose an adjustable parameter, and it is not clear how this should best be done.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identity Authentication based on Audio Visual Biometrics: A Survey

Biometric authentication is an emerging technology that utilize biometric data for the purpose of person identification or recognition in security applications. A number of biometrics can be used in a person authentication system. Among the widely used biometrics, voice and face traits are most promising for pervasive application in every life, because they can be easily obtained using unobtrus...

متن کامل

Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams

Person identification using audio or visual biometrics is a well-studied problem in pattern recognition. In this scenario, both training and testing are done on the same modalities. However, there can be situations where this condition is not valid, i.e. training and testing has to be done on different modalities. This could arise, for example, in covert surveillance. Is there any person specif...

متن کامل

Audio-Visual Emotion Recognition Using Semi-Coupled HMM and Error-Weighted Classifier Combination

This paper presents an approach to automatic recognition of emotional states from audio-visual bimodal signals using semi-coupled hidden Markov model and error weighted classifier combination for Human-Computer Interaction (HCI). The proposed model combines a simplified state-based bimodal alignment strategy and a Bayesian classifier weighting scheme to obtain the optimal solution for audio-vis...

متن کامل

Audio-visual person identification on the XM2VTS database

This paper presents a multimodal person identification system based on combination of audio and visual classifiers. The audio classifier was built by using mel-frequency cepstrum coefficient features and Gaussian mixture models. The visual classifier was implemented by Haar-like features and AdaBoost algorithm for face detection, and principal component analysis for identification. A new method...

متن کامل

Robust Automatic Human Identification Using Face, Mouth, and Acoustic Information

Discriminatory information about person identity is multimodal. Yet, most person recognition systems are unimodal, e.g. the use of facial appearance. With a view to exploiting the complementary nature of different modes of information and increasing pattern recognition robustness to test signal degradation, we developed a multiple expert biometric person identification system that combines info...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Information Fusion

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2009